Overview

Dataset statistics

Number of variables57
Number of observations15120
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 MiB
Average record size in memory456.0 B

Variable types

BOOL44
NUM13

Reproduction

Analysis started2020-08-31 10:24:08.354471
Analysis finished2020-08-31 10:24:48.702529
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Id is highly correlated with df_indexHigh Correlation
df_index is highly correlated with IdHigh Correlation
Horizontal_Distance_To_Hydrology has 1590 (10.5%) zeros Zeros
Vertical_Distance_To_Hydrology has 1890 (12.5%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE
Distinct count15120
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7559.5
Minimum0
Maximum15119
Zeros1
Zeros (%)< 0.1%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile755.95
Q13779.75
median7559.5
Q311339.25
95-th percentile14363.05
Maximum15119
Range15119
Interquartile range (IQR)7559.5

Descriptive statistics

Standard deviation4364.91237
Coefficient of variation (CV)0.5774075495
Kurtosis-1.2
Mean7559.5
Median Absolute Deviation (MAD)3780
Skewness0
Sum114299640
Variance19052460
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 15119.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
613 1 < 0.1%
 
8833 1 < 0.1%
 
10880 1 < 0.1%
 
4727 1 < 0.1%
 
6774 1 < 0.1%
 
629 1 < 0.1%
 
2676 1 < 0.1%
 
12915 1 < 0.1%
 
14962 1 < 0.1%
 
Other values (15110) 15110 99.9%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
15119 1 < 0.1%
 
15118 1 < 0.1%
 
15117 1 < 0.1%
 
15116 1 < 0.1%
 
15115 1 < 0.1%
 

Id
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE
Distinct count15120
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7560.5
Minimum1
Maximum15120
Zeros0
Zeros (%)0.0%
Memory size118.2 KiB

Quantile statistics

Minimum1
5-th percentile756.95
Q13780.75
median7560.5
Q311340.25
95-th percentile14364.05
Maximum15120
Range15119
Interquartile range (IQR)7559.5

Descriptive statistics

Standard deviation4364.91237
Coefficient of variation (CV)0.5773311779
Kurtosis-1.2
Mean7560.5
Median Absolute Deviation (MAD)3780
Skewness0
Sum114314760
Variance19052460
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.000e+00 1.512e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
6758 1 < 0.1%
 
14978 1 < 0.1%
 
8833 1 < 0.1%
 
10880 1 < 0.1%
 
4727 1 < 0.1%
 
6774 1 < 0.1%
 
629 1 < 0.1%
 
2676 1 < 0.1%
 
12915 1 < 0.1%
 
Other values (15110) 15110 99.9%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
15120 1 < 0.1%
 
15119 1 < 0.1%
 
15118 1 < 0.1%
 
15117 1 < 0.1%
 
15116 1 < 0.1%
 

Elevation
Real number (ℝ≥0)

Distinct count1665
Unique (%)11.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2749.322553
Minimum1863
Maximum3849
Zeros0
Zeros (%)0.0%
Memory size118.2 KiB

Quantile statistics

Minimum1863
5-th percentile2117
Q12376
median2752
Q33104
95-th percentile3397
Maximum3849
Range1986
Interquartile range (IQR)728

Descriptive statistics

Standard deviation417.6781873
Coefficient of variation (CV)0.151920402
Kurtosis-1.082115791
Mean2749.322553
Median Absolute Deviation (MAD)356.6681965
Skewness0.07563970694
Sum41569757
Variance174455.0682
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1863. 1897. 1999.5 2024.5 2101.5 ... 3449.5 3480.5 3528. 3576.5 3849. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2290 25 0.2%
 
2830 25 0.2%
 
3371 24 0.2%
 
3244 23 0.2%
 
2820 23 0.2%
 
2955 23 0.2%
 
2795 23 0.2%
 
2952 23 0.2%
 
2962 22 0.1%
 
2304 22 0.1%
 
Other values (1655) 14887 98.5%
 
ValueCountFrequency (%) 
1863 1 < 0.1%
 
1874 1 < 0.1%
 
1879 1 < 0.1%
 
1888 1 < 0.1%
 
1889 2 < 0.1%
 
ValueCountFrequency (%) 
3849 2 < 0.1%
 
3848 1 < 0.1%
 
3846 2 < 0.1%
 
3844 1 < 0.1%
 
3842 1 < 0.1%
 

Aspect
Real number (ℝ≥0)

Distinct count361
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean156.6766534
Minimum0
Maximum360
Zeros110
Zeros (%)0.7%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile13
Q165
median126
Q3261
95-th percentile344
Maximum360
Range360
Interquartile range (IQR)196

Descriptive statistics

Standard deviation110.0858014
Coefficient of variation (CV)0.7026305386
Kurtosis-1.150244484
Mean156.6766534
Median Absolute Deviation (MAD)95.39405236
Skewness0.450935294
Sum2368951
Variance12118.88367
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 43.5 44.5 45.5 ... 250.5 282.5 306.5 359.5 360. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
45 117 0.8%
 
0 110 0.7%
 
90 109 0.7%
 
63 89 0.6%
 
76 87 0.6%
 
27 82 0.5%
 
315 81 0.5%
 
75 80 0.5%
 
108 79 0.5%
 
117 78 0.5%
 
Other values (351) 14208 94.0%
 
ValueCountFrequency (%) 
0 110 0.7%
 
1 48 0.3%
 
2 50 0.3%
 
3 54 0.4%
 
4 51 0.3%
 
ValueCountFrequency (%) 
360 2 < 0.1%
 
359 33 0.2%
 
358 47 0.3%
 
357 58 0.4%
 
356 50 0.3%
 

Slope
Real number (ℝ≥0)

Distinct count52
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.5015873
Minimum0
Maximum52
Zeros5
Zeros (%)< 0.1%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile5
Q110
median15
Q322
95-th percentile32
Maximum52
Range52
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.453926762
Coefficient of variation (CV)0.5123099134
Kurtosis-0.2383101358
Mean16.5015873
Median Absolute Deviation (MAD)6.936201814
Skewness0.5236583383
Sum249504
Variance71.4688777
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 3.5 ... 39.5 41.5 45.5 46.5 52. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
11 740 4.9%
 
10 739 4.9%
 
13 717 4.7%
 
14 699 4.6%
 
12 677 4.5%
 
9 664 4.4%
 
15 664 4.4%
 
16 640 4.2%
 
17 598 4.0%
 
8 574 3.8%
 
Other values (42) 8408 55.6%
 
ValueCountFrequency (%) 
0 5 < 0.1%
 
1 78 0.5%
 
2 134 0.9%
 
3 210 1.4%
 
4 305 2.0%
 
ValueCountFrequency (%) 
52 1 < 0.1%
 
50 1 < 0.1%
 
49 5 < 0.1%
 
48 1 < 0.1%
 
47 3 < 0.1%
 

Horizontal_Distance_To_Hydrology
Real number (ℝ≥0)

ZEROS
Distinct count400
Unique (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean227.1957011
Minimum0
Maximum1343
Zeros1590
Zeros (%)10.5%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q167
median180
Q3330
95-th percentile631
Maximum1343
Range1343
Interquartile range (IQR)263

Descriptive statistics

Standard deviation210.0752957
Coefficient of variation (CV)0.9246446774
Kurtosis2.803984388
Mean227.1957011
Median Absolute Deviation (MAD)160.2756468
Skewness1.488052491
Sum3435199
Variance44131.62986
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 15. 36. 87.5 92.5 ... 793.5 899.5 903. 1188.5 1343. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1590 10.5%
 
30 1207 8.0%
 
150 497 3.3%
 
60 490 3.2%
 
42 452 3.0%
 
67 411 2.7%
 
85 381 2.5%
 
108 361 2.4%
 
90 284 1.9%
 
120 283 1.9%
 
Other values (390) 9164 60.6%
 
ValueCountFrequency (%) 
0 1590 10.5%
 
30 1207 8.0%
 
42 452 3.0%
 
60 490 3.2%
 
67 411 2.7%
 
ValueCountFrequency (%) 
1343 1 < 0.1%
 
1318 1 < 0.1%
 
1294 1 < 0.1%
 
1261 2 < 0.1%
 
1260 2 < 0.1%
 

Vertical_Distance_To_Hydrology
Real number (ℝ)

ZEROS
Distinct count423
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean51.07652116
Minimum-146
Maximum554
Zeros1890
Zeros (%)12.5%
Memory size118.2 KiB

Quantile statistics

Minimum-146
5-th percentile-4
Q15
median32
Q379
95-th percentile176
Maximum554
Range700
Interquartile range (IQR)74

Descriptive statistics

Standard deviation61.23940613
Coefficient of variation (CV)1.198973711
Kurtosis3.403498704
Mean51.07652116
Median Absolute Deviation (MAD)46.86993488
Skewness1.53777568
Sum772277
Variance3750.264863
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-146. -82.5 -43.5 -26.5 -15.5 ... 232.5 264.5 320. 407. 554. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1890 12.5%
 
5 217 1.4%
 
3 206 1.4%
 
4 200 1.3%
 
8 198 1.3%
 
7 182 1.2%
 
10 176 1.2%
 
9 166 1.1%
 
2 165 1.1%
 
6 162 1.1%
 
Other values (413) 11558 76.4%
 
ValueCountFrequency (%) 
-146 1 < 0.1%
 
-134 1 < 0.1%
 
-123 1 < 0.1%
 
-115 1 < 0.1%
 
-114 1 < 0.1%
 
ValueCountFrequency (%) 
554 1 < 0.1%
 
547 2 < 0.1%
 
411 1 < 0.1%
 
403 1 < 0.1%
 
401 1 < 0.1%
 

Horizontal_Distance_To_Roadways
Real number (ℝ≥0)

Distinct count3250
Unique (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1714.023214
Minimum0
Maximum6890
Zeros3
Zeros (%)< 0.1%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile242
Q1764
median1316
Q32270
95-th percentile4635.1
Maximum6890
Range6890
Interquartile range (IQR)1506

Descriptive statistics

Standard deviation1325.066358
Coefficient of variation (CV)0.7730737525
Kurtosis1.022419366
Mean1714.023214
Median Absolute Deviation (MAD)1030.750914
Skewness1.247810678
Sum25916031
Variance1755800.854
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 87.5 114. 130.5 142. ... 4355.5 5320.5 6028. 6367. 6890. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
150 88 0.6%
 
120 56 0.4%
 
390 47 0.3%
 
618 45 0.3%
 
1110 43 0.3%
 
700 41 0.3%
 
108 38 0.3%
 
1273 37 0.2%
 
900 37 0.2%
 
212 37 0.2%
 
Other values (3240) 14651 96.9%
 
ValueCountFrequency (%) 
0 3 < 0.1%
 
30 15 0.1%
 
42 5 < 0.1%
 
60 11 0.1%
 
67 13 0.1%
 
ValueCountFrequency (%) 
6890 1 < 0.1%
 
6836 1 < 0.1%
 
6811 1 < 0.1%
 
6766 1 < 0.1%
 
6679 1 < 0.1%
 

Hillshade_9am
Real number (ℝ≥0)

Distinct count176
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean212.7042989
Minimum0
Maximum254
Zeros1
Zeros (%)< 0.1%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile151
Q1196
median220
Q3235
95-th percentile250
Maximum254
Range254
Interquartile range (IQR)39

Descriptive statistics

Standard deviation30.56128689
Coefficient of variation (CV)0.143679686
Kurtosis1.218810484
Mean212.7042989
Median Absolute Deviation (MAD)24.04601365
Skewness-1.093680561
Sum3216089
Variance933.9922561
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 79. 98.5 112.5 122.5 ... 218.5 233.5 239.5 253.5 254. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
226 279 1.8%
 
229 269 1.8%
 
224 265 1.8%
 
228 261 1.7%
 
230 260 1.7%
 
233 248 1.6%
 
223 245 1.6%
 
219 242 1.6%
 
231 239 1.6%
 
225 236 1.6%
 
Other values (166) 12576 83.2%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
58 1 < 0.1%
 
59 2 < 0.1%
 
65 1 < 0.1%
 
73 1 < 0.1%
 
ValueCountFrequency (%) 
254 190 1.3%
 
253 200 1.3%
 
252 189 1.2%
 
251 174 1.2%
 
250 192 1.3%
 

Hillshade_Noon
Real number (ℝ≥0)

Distinct count141
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean218.9656085
Minimum99
Maximum254
Zeros0
Zeros (%)0.0%
Memory size118.2 KiB

Quantile statistics

Minimum99
5-th percentile175
Q1207
median223
Q3235
95-th percentile250
Maximum254
Range155
Interquartile range (IQR)28

Descriptive statistics

Standard deviation22.80196554
Coefficient of variation (CV)0.1041349174
Kurtosis1.153484179
Mean218.9656085
Median Absolute Deviation (MAD)17.73523558
Skewness-0.9532317075
Sum3310760
Variance519.9296327
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 99. 131.5 146.5 153.5 167.5 ... 232.5 236.5 248.5 253.5 254. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
225 327 2.2%
 
229 324 2.1%
 
226 320 2.1%
 
224 313 2.1%
 
230 311 2.1%
 
223 303 2.0%
 
232 298 2.0%
 
222 297 2.0%
 
228 294 1.9%
 
218 293 1.9%
 
Other values (131) 12040 79.6%
 
ValueCountFrequency (%) 
99 4 < 0.1%
 
102 1 < 0.1%
 
103 1 < 0.1%
 
107 1 < 0.1%
 
111 2 < 0.1%
 
ValueCountFrequency (%) 
254 133 0.9%
 
253 163 1.1%
 
252 152 1.0%
 
251 183 1.2%
 
250 167 1.1%
 

Hillshade_3pm
Real number (ℝ≥0)

Distinct count247
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135.0919974
Minimum0
Maximum248
Zeros88
Zeros (%)0.6%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile53
Q1106
median138
Q3167
95-th percentile207
Maximum248
Range248
Interquartile range (IQR)61

Descriptive statistics

Standard deviation45.89518871
Coefficient of variation (CV)0.3397328458
Kurtosis-0.08734390755
Mean135.0919974
Median Absolute Deviation (MAD)36.48636362
Skewness-0.3408272326
Sum2042591
Variance2106.368347
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 0.5 11.5 23.5 34.5 ... 199.5 218.5 225.5 238.5 248. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
143 182 1.2%
 
149 161 1.1%
 
132 156 1.0%
 
133 154 1.0%
 
142 154 1.0%
 
136 154 1.0%
 
137 152 1.0%
 
138 148 1.0%
 
154 148 1.0%
 
152 145 1.0%
 
Other values (237) 13566 89.7%
 
ValueCountFrequency (%) 
0 88 0.6%
 
1 1 < 0.1%
 
3 3 < 0.1%
 
4 1 < 0.1%
 
6 2 < 0.1%
 
ValueCountFrequency (%) 
248 2 < 0.1%
 
247 4 < 0.1%
 
246 4 < 0.1%
 
245 4 < 0.1%
 
244 3 < 0.1%
 
Distinct count2710
Unique (%)17.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1511.147288
Minimum0
Maximum6993
Zeros2
Zeros (%)< 0.1%
Memory size118.2 KiB

Quantile statistics

Minimum0
5-th percentile296.9
Q1730
median1256
Q31988.25
95-th percentile3663.05
Maximum6993
Range6993
Interquartile range (IQR)1258.25

Descriptive statistics

Standard deviation1099.936493
Coefficient of variation (CV)0.7278817235
Kurtosis3.385415788
Mean1511.147288
Median Absolute Deviation (MAD)818.8952133
Skewness1.617098874
Sum22848547
Variance1209860.288
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 0. 63.5 122. 211. 214. ... 3606.5 4318. 6229. 6623.5 6993. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
618 65 0.4%
 
541 51 0.3%
 
636 45 0.3%
 
607 43 0.3%
 
573 42 0.3%
 
960 42 0.3%
 
752 41 0.3%
 
942 40 0.3%
 
342 40 0.3%
 
242 40 0.3%
 
Other values (2700) 14671 97.0%
 
ValueCountFrequency (%) 
0 2 < 0.1%
 
30 9 0.1%
 
42 11 0.1%
 
60 10 0.1%
 
67 20 0.1%
 
ValueCountFrequency (%) 
6993 1 < 0.1%
 
6853 1 < 0.1%
 
6723 1 < 0.1%
 
6686 1 < 0.1%
 
6661 1 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
11523
1
3597
ValueCountFrequency (%) 
0 11523 76.2%
 
1 3597 23.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14621
1
 
499
ValueCountFrequency (%) 
0 14621 96.7%
 
1 499 3.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
8771
1
6349
ValueCountFrequency (%) 
0 8771 58.0%
 
1 6349 42.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
10445
1
4675
ValueCountFrequency (%) 
0 10445 69.1%
 
1 4675 30.9%
 

Soil_Type1
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14765
1
 
355
ValueCountFrequency (%) 
0 14765 97.7%
 
1 355 2.3%
 

Soil_Type2
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14497
1
 
623
ValueCountFrequency (%) 
0 14497 95.9%
 
1 623 4.1%
 

Soil_Type3
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14158
1
 
962
ValueCountFrequency (%) 
0 14158 93.6%
 
1 962 6.4%
 

Soil_Type4
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14277
1
 
843
ValueCountFrequency (%) 
0 14277 94.4%
 
1 843 5.6%
 

Soil_Type5
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14955
1
 
165
ValueCountFrequency (%) 
0 14955 98.9%
 
1 165 1.1%
 

Soil_Type6
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14470
1
 
650
ValueCountFrequency (%) 
0 14470 95.7%
 
1 650 4.3%
 

Soil_Type7
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15120
ValueCountFrequency (%) 
0 15120 100.0%
 

Soil_Type8
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15119
1
 
1
ValueCountFrequency (%) 
0 15119 > 99.9%
 
1 1 < 0.1%
 

Soil_Type9
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15110
1
 
10
ValueCountFrequency (%) 
0 15110 99.9%
 
1 10 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
12978
1
 
2142
ValueCountFrequency (%) 
0 12978 85.8%
 
1 2142 14.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14714
1
 
406
ValueCountFrequency (%) 
0 14714 97.3%
 
1 406 2.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14893
1
 
227
ValueCountFrequency (%) 
0 14893 98.5%
 
1 227 1.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14644
1
 
476
ValueCountFrequency (%) 
0 14644 96.9%
 
1 476 3.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14951
1
 
169
ValueCountFrequency (%) 
0 14951 98.9%
 
1 169 1.1%
 

Soil_Type15
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15120
ValueCountFrequency (%) 
0 15120 100.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15006
1
 
114
ValueCountFrequency (%) 
0 15006 99.2%
 
1 114 0.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14508
1
 
612
ValueCountFrequency (%) 
0 14508 96.0%
 
1 612 4.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15060
1
 
60
ValueCountFrequency (%) 
0 15060 99.6%
 
1 60 0.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15074
1
 
46
ValueCountFrequency (%) 
0 15074 99.7%
 
1 46 0.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14981
1
 
139
ValueCountFrequency (%) 
0 14981 99.1%
 
1 139 0.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15104
1
 
16
ValueCountFrequency (%) 
0 15104 99.9%
 
1 16 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14775
1
 
345
ValueCountFrequency (%) 
0 14775 97.7%
 
1 345 2.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14363
1
 
757
ValueCountFrequency (%) 
0 14363 95.0%
 
1 757 5.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14863
1
 
257
ValueCountFrequency (%) 
0 14863 98.3%
 
1 257 1.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15119
1
 
1
ValueCountFrequency (%) 
0 15119 > 99.9%
 
1 1 < 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15066
1
 
54
ValueCountFrequency (%) 
0 15066 99.6%
 
1 54 0.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15105
1
 
15
ValueCountFrequency (%) 
0 15105 99.9%
 
1 15 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15111
1
 
9
ValueCountFrequency (%) 
0 15111 99.9%
 
1 9 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
13829
1
 
1291
ValueCountFrequency (%) 
0 13829 91.5%
 
1 1291 8.5%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14395
1
 
725
ValueCountFrequency (%) 
0 14395 95.2%
 
1 725 4.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14788
1
 
332
ValueCountFrequency (%) 
0 14788 97.8%
 
1 332 2.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14430
1
 
690
ValueCountFrequency (%) 
0 14430 95.4%
 
1 690 4.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14504
1
 
616
ValueCountFrequency (%) 
0 14504 95.9%
 
1 616 4.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15098
1
 
22
ValueCountFrequency (%) 
0 15098 99.9%
 
1 22 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15018
1
 
102
ValueCountFrequency (%) 
0 15018 99.3%
 
1 102 0.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15110
1
 
10
ValueCountFrequency (%) 
0 15110 99.9%
 
1 10 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
15086
1
 
34
ValueCountFrequency (%) 
0 15086 99.8%
 
1 34 0.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14392
1
 
728
ValueCountFrequency (%) 
0 14392 95.2%
 
1 728 4.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14463
1
 
657
ValueCountFrequency (%) 
0 14463 95.7%
 
1 657 4.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size118.2 KiB
0
14661
1
 
459
ValueCountFrequency (%) 
0 14661 97.0%
 
1 459 3.0%
 

Cover_Type
Real number (ℝ≥0)

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Memory size118.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.000066141
Coefficient of variation (CV)0.5000165352
Kurtosis-1.250016528
Mean4
Median Absolute Deviation (MAD)1.714285714
Skewness0
Sum60480
Variance4.000264568
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 1.5 6.5 7. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 2160 14.3%
 
6 2160 14.3%
 
5 2160 14.3%
 
4 2160 14.3%
 
3 2160 14.3%
 
2 2160 14.3%
 
1 2160 14.3%
 
ValueCountFrequency (%) 
1 2160 14.3%
 
2 2160 14.3%
 
3 2160 14.3%
 
4 2160 14.3%
 
5 2160 14.3%
 
ValueCountFrequency (%) 
7 2160 14.3%
 
6 2160 14.3%
 
5 2160 14.3%
 
4 2160 14.3%
 
3 2160 14.3%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

df_indexIdElevationAspectSlopeHorizontal_Distance_To_HydrologyVertical_Distance_To_HydrologyHorizontal_Distance_To_RoadwaysHillshade_9amHillshade_NoonHillshade_3pmHorizontal_Distance_To_Fire_PointsWilderness_Area1Wilderness_Area2Wilderness_Area3Wilderness_Area4Soil_Type1Soil_Type2Soil_Type3Soil_Type4Soil_Type5Soil_Type6Soil_Type7Soil_Type8Soil_Type9Soil_Type10Soil_Type11Soil_Type12Soil_Type13Soil_Type14Soil_Type15Soil_Type16Soil_Type17Soil_Type18Soil_Type19Soil_Type20Soil_Type21Soil_Type22Soil_Type23Soil_Type24Soil_Type25Soil_Type26Soil_Type27Soil_Type28Soil_Type29Soil_Type30Soil_Type31Soil_Type32Soil_Type33Soil_Type34Soil_Type35Soil_Type36Soil_Type37Soil_Type38Soil_Type39Soil_Type40Cover_Type
01156311564280039152975816272192061182579100000000000000000000000000000001000000000002
1200420052764612425510725823118372834100000000000000000000000000000000100000000005
298459846351515827120-3732212372321061140001000000000000000000000000000000000001000007
38744874527104722212662184221185872341001000000000010000000000000000000000000000006
462896290251592252687291124919155626000110000000000000000000000000000000000000003
5130713083151266151271656641812452032460100000000000000000000000010000000000000000002
6276527662073981315015698241222109849000101000000000000000000000000000000000000003
7564956502247112001189217234155713000101000000000000000000000000000000000000004
869096910274840202016011952171911001624001000000000010000000000000000000000000000002
9343234332185121281508581925420559201000100100000000000000000000000000000000000004

Last rows

df_indexIdElevationAspectSlopeHorizontal_Distance_To_HydrologyVertical_Distance_To_HydrologyHorizontal_Distance_To_RoadwaysHillshade_9amHillshade_NoonHillshade_3pmHorizontal_Distance_To_Fire_PointsWilderness_Area1Wilderness_Area2Wilderness_Area3Wilderness_Area4Soil_Type1Soil_Type2Soil_Type3Soil_Type4Soil_Type5Soil_Type6Soil_Type7Soil_Type8Soil_Type9Soil_Type10Soil_Type11Soil_Type12Soil_Type13Soil_Type14Soil_Type15Soil_Type16Soil_Type17Soil_Type18Soil_Type19Soil_Type20Soil_Type21Soil_Type22Soil_Type23Soil_Type24Soil_Type25Soil_Type26Soil_Type27Soil_Type28Soil_Type29Soil_Type30Soil_Type31Soil_Type32Soil_Type33Soil_Type34Soil_Type35Soil_Type36Soil_Type37Soil_Type38Soil_Type39Soil_Type40Cover_Type
1511073687369341756113426828482272181231380001000000000000000000000000000000000000001007
151115035503622073472842171648154181157192000100000000000001000000000000000000000000006
1511210804108052625478170331784223223134979001000100000000000000000000000000000000000003
15113801980202578352771915002132301551690001001000000000000000000000000000000000000005
1511418871888274428839562282112391661187100000000000000000000000000000000100000000005
15115300230032339106292104974125419245525000100000100000000000000000000000000000000003
15116156615673262885242106300228231137420100000000000000000000000010000000000000000001
151172221222227343428201104132320416885664001000000000000000000000000000000000100000006
151186119612023023127001350201235173836000100000000000000001000000000000000000000006
15119974497453107300102833230681922361841613001000000000000000000000000000000010000000002